(2.2.3.2) Optimism in face of uncertainty
2.2.3.2 Optimism in face of uncertainty
The principle of "optimism in the face of uncertainty" is proposed to solve the exploration-exploitation trade-off. Many algorithms in reinforcement learning that solve the tradeoffs follow this principle. (*8) In the case of optimistic misunderstanding, they play that slot machine again. After several times of play. They find that the probability is not as good as the experience.
They do not have a chance to correct pessimistic misunderstandings, but they have a chance to correct optimistic misunderstanding later. In other words, pessimistic misunderstandings are more severe than optimistic misunderstandings. Therefore, by making judgments optimistic, they are well-balanced. It is the principle of "optimism in the face of uncertainty." ----
----
Footnote:
en.icon